NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Worm Perturb-Seq: massively parallel whole-animal RNAi and RNA-seq

https://doi.org/10.1038/s41467-025-60154-0

Zhang, Hefei; Li, Xuhang; Song, Dongyuan; Yukselen, Onur; Nanda, Shivani; Kucukural, Alper; Li, Jingyi Jessica; Garber, Manuel; Walhout, Albertha_J M (December 2025, Nature Communications)

Free, publicly-accessible full text available December 1, 2026
Leadership at the Intersection of Statistics & Genomics: A COPSS-NISS Leadership Webinar with Drs. Rafael Irizarry and Mingyao Li

https://doi.org/10.1007/s12561-024-09455-4

Li, Jingyi Jessica (December 2024, Statistics in Biosciences)

Abstract In a COPSS-NISS webinar focused on leadership at the intersection of statistics and genomics, esteemed panelists Drs. Rafael Irizarry and Mingyao Li shared their leadership journeys and provided insights into this interdisciplinary field to inspire future leaders. They discussed the value of statistics in distinguishing signal from noise in the artificial intelligence (AI) era, the strengths of statisticians in ensuring rigor and robustness in genomics research, and the trade-offs between model expressiveness and interpretability. Additionally, they offered advice on how junior faculty can seek collaborations and increase their visibility, balance staying current with technological advancements, while developing methods carefully and thoroughly, and best practices for collaborating with domain experts. The recording of the webinar is available athttps://www.youtube.com/watch?v=t6SsAoh95ig.
more » « less
Free, publicly-accessible full text available December 1, 2025
Comment on “Data Fission: Splitting a Single Data Point” Data Fission for Unsupervised Learning: A Discussion on Post-Clustering Inference and the Challenges of Debiasing

https://doi.org/10.1080/01621459.2024.2412191

Wang, Changhu; Ge, Xinzhou; Song, Dongyuan; Li, Jingyi Jessica (January 2025, Journal of the American Statistical Association)

Free, publicly-accessible full text available January 2, 2026
Statistical method scDEED for detecting dubious 2D single-cell embeddings and optimizing t-SNE and UMAP hyperparameters

https://doi.org/10.1038/s41467-024-45891-y

Xia, Lucy; Lee, Christy; Li, Jingyi Jessica (December 2024, Nature Communications)

Abstract Two-dimensional (2D) embedding methods are crucial for single-cell data visualization. Popular methods such as t-distributed stochastic neighbor embedding (t-SNE) and uniform manifold approximation and projection (UMAP) are commonly used for visualizing cell clusters; however, it is well known that t-SNE and UMAP’s 2D embeddings might not reliably inform the similarities among cell clusters. Motivated by this challenge, we present a statistical method, scDEED, for detecting dubious cell embeddings output by a 2D-embedding method. By calculating a reliability score for every cell embedding based on the similarity between the cell’s 2D-embedding neighbors and pre-embedding neighbors, scDEED identifies the cell embeddings with low reliability scores as dubious and those with high reliability scores as trustworthy. Moreover, by minimizing the number of dubious cell embeddings, scDEED provides intuitive guidance for optimizing the hyperparameters of an embedding method. We show the effectiveness of scDEED on multiple datasets for detecting dubious cell embeddings and optimizing the hyperparameters of t-SNE and UMAP.
more » « less
Free, publicly-accessible full text available December 1, 2025
Response to "Neglecting normalization impact in semi-synthetic RNA-seq data simulation generates artificial false positives" and "Winsorization greatly reduces false positives by popular differential expression methods when analyzing human population samples"

https://doi.org/10.1186/s13059-024-03232-8

Ge, Xinzhou; Li, Yumei; Li, Wei; Li, Jingyi Jessica (December 2024, Genome Biology)

Abstract Two correspondences raised concerns or comments about our analyses regarding exaggerated false positives found by differential expression (DE) methods. Here, we discuss the points they raise and explain why we agree or disagree with these points. We add new analysis to confirm that the Wilcoxon rank-sum test remains the most robust method compared to the other five DE methods (DESeq2, edgeR, limma-voom, dearseq, and NOISeq) in two-condition DE analyses after considering normalization and winsorization, the data preprocessing steps discussed in the two correspondences.
more » « less
Free, publicly-accessible full text available December 1, 2025
Dissecting Gene Expression Heterogeneity: Generalized Pearson Correlation Squares and the K -Lines Clustering Algorithm

https://doi.org/10.1080/01621459.2024.2342639

Li, Jingyi Jessica; Zhou, Heather J; Bickel, Peter J; Tong, Xin (May 2024, Journal of the American Statistical Association)

Full Text Available
Decoding heterogeneous single-cell perturbation responses

https://doi.org/10.1038/s41556-025-01626-9

Song, Bicna; Liu, Dingyu; Dai, Weiwei; McMyn, Natalie F; Wang, Qingyang; Yang, Dapeng; Krejci, Adam; Vasilyev, Anatoly; Untermoser, Nicole; Loregger, Anke; et al (March 2025, Nature Cell Biology)

Free, publicly-accessible full text available March 1, 2026
Hierarchical Neyman-Pearson Classification for Prioritizing Severe Disease Categories in COVID-19 Patient Data

https://doi.org/10.1080/01621459.2023.2270657

Wang, Lijia; Wang, Y. X.; Li, Jingyi Jessica; Tong, Xin (January 2024, Journal of the American Statistical Association)

Full Text Available
scDesign3 generates realistic in silico data for multimodal single-cell and spatial omics

https://doi.org/10.1038/s41587-023-01772-1

Song, Dongyuan; Wang, Qingyang; Yan, Guanao; Liu, Tianyang; Sun, Tianyi; Li, Jingyi Jessica (February 2024, Nature Biotechnology)

Full Text Available
A Python Package itca for Information-Theoretic Classification Accuracy: A Criterion That Guides Data-Driven Combination of Ambiguous Outcome Labels in Multiclass Classification

https://doi.org/10.1089/cmb.2023.0191

Zhang, Chihao; Zhang, Shihua; Li, Jingyi Jessica (November 2023, Journal of Computational Biology)

Full Text Available

« Prev Next »

Search for: All records